Is this Model for Real? Simulating Data to Reveal the Proximity of a Model to Reality

نویسندگان

  • Rinat B. Rosenberg-Kima
  • Zachary A. Pardos
چکیده

Simulated data plays a central role in Educational Data Mining and in particular in Bayesian Knowledge Tracing (BKT) research. The initial motivation for this paper was to try to answer the question: given two datasets could you tell which of them is real and which of them is simulated? The ability to answer this question may provide an additional indication of the goodness of the model, thus, if it is easy to discern simulated data from real data that could be an indication that the model does not provide an authentic representation of reality, whereas if it is hard to set the real and simulated data apart that might be an indication that the model is indeed authentic. In this paper we will describe analyses of 42 GLOP datasets that were performed in an attempt to address this question. Possible simulated data based metrics as well as additional findings that emerged during this exploration will be discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatio-temporal agent based simulation of COVID-19 disease and investigating the effect of vaccination (case study: Urmia)

Proper management of epidemic diseases such as Covid-19 is very important because of its effects on the economy, culture and society of nations. By applying various control strategies such as closing schools, restricting night traffic and mass vaccination program, the spread of this disease has been somewhat controlled but not completely stopped. The main goal of this research is to provide a f...

متن کامل

Developing a model for simulating urban expansion based on the concept of decision risk: A case study in Babol city

Today, the study of the spatial-temporal pattern of urban physical expansion and the identification of the parameters affecting the expansion play a crucial role in urban-related decision-making and long-term planning processes. Consequently, the use of precise and efficient methods to predict the physical expansion of urban areas is of great importance. The objective of present study is to pro...

متن کامل

Robust DEA under discrete uncertain data: a case study of Iranian electricity distribution companies

Crisp input and output data are fundamentally indispensable in traditional data envelopment analysis (DEA). However, the real-world problems often deal with imprecise or ambiguous data. In this paper, we propose a novel robust data envelopment model (RDEA) to investigate the efficiencies of decision-making units (DMU) when there are discrete uncertain input and output data. The method is based ...

متن کامل

Monitoring and Diagnosing Multistage Processes: A Review of Cause Selecting Control Charts

A review of the literature on cause selecting charts (CSCs) in multistage processes is given, with a concentration on developments which have occurred since 1993. Model based control charts and multiple cause selecting charts (MCSCs) are reviewed. Several articles based on normally and non-normally distributed outgoing quality characteristics are analyzed and important issues such as economic d...

متن کامل

What is information? with an Emphasis on Zins’s Views

Purpose: The aim of this paper is to explore some important views and theories of information and knowledge in the area of library and information science (LIS) and then to propose a new model of information and knowledge. Method: This research has been conducted by using a comparative content analysis of the existing materials on conceptualizations and definitions of information and knowledge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015